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ABSTRACT: The energy resolution of a highly granular 1 m 3 analogue scintillator-steel hadronic 
calorimeter is studied using charged pions with energies from lOGeV to 80GeV at the CERN 
SPS. The energy resolution for single hadrons is determined to be approximately 58%/ ^E/GeV. 
This resolution is improved to approximately 45% / y/E/GeV with software compensation tech- 
niques. These techniques take advantage of the event-by-event information about the substructure 
of hadronic showers which is provided by the imaging capabilities of the calorimeter. The en- 
ergy reconstruction is improved either with corrections based on the local energy density or by 
applying a single correction factor to the event energy sum derived from a global measure of the 
shower energy density. The application of the compensation algorithms to GEANT4 simulations 
yield resolution improvements comparable to those observed for real data. 
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1. Introduction 

The physics goals of future high-energy lepton colliders such as the ILC flU] or CLIC [0] put 
stringent requirements on the detector systems. For example, the efficient event-by-event sep- 
aration of heavy bosons in hadronic final states requires a jet energy resolution of better than 
4% OH]. This is achievable with Particle Flow Algorithms (PFA) combined with highly granular 
calorimeters [0, ii 0]. The C ALICE collaboration has constructed and extensively studied highly 
granular electromagnetic and hadronic calorimeter prototypes to evaluate detector technologies for 
future linear collider experiments. These calorimeters have been successfully operated in various 
test beam experiments in different configurations at DESY, CERN and Fermilab from 2006 until 
2012. The unprecedented granularity of the CALICE calorimeter prototypes allows the structure of 
hadronic showers to be studied with high spatial resolution, in order to validate different simulation 
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Figure 1. Top view of the CALICE test beam apparatus in the CERN SPS H6 beam line including calorime- 
ters, trigger components (scintillator triggers SCI, SC2, and SC3; large area muon trigger counters Mcl, 
which was installed only during calibration runs, and Mc2; and the beam halo veto), and the tracking drift 
chambers DC1, DC2, and DC3. The beam enters from the left. Dimensions are in millimetres. Figure is not 
to scale. Positions are given at detector centre. 



models (for one example of such studies see O) and to test particle flow algorithms, as demon- 
strated in fl7J] . The high granularity also offers the possibility for advanced energy reconstruction 
methods, the subject of this paper. 

We present a study of the hadronic energy resolution of the CALICE analogue scintillator- 
steel hadronic calorimeter (AHCAL) flED using data taken at the CERN SPS in 2007 with positive 
and negative pion beams in the energy range from 10 to 80 GeV. Two software compensation 
techniques, which weight energy depositions based on information about the local energy den- 
sity within the shower obtained from the highly granular readout, are discussed in detail. Both 
techniques achieve an improvement of the hadronic energy resolution by approximately 20% for 
single hadrons in the energy range from 10 to 80 GeV, with a reduction of the stochastic term from 
- 58%/i/£/GeV to ~ 45%/ v / £/GeV. 

In Section we briefly describe the test beam setup, discuss the event selection and describe the 
energy reconstruction, calibration and the determination of the energy resolution in the AHCAL. 
The software compensation techniques are presented in Section 0, and Section |] summarises the 
results obtained from data and compares them to simulations. 



2. Energy reconstruction in the AHCAL 
2.1 Test beam setup 

The complete CALICE setup in the H6 beam line at the CERN SPS for the 2007 beam pe- 
riod, illustrated in Figure [I], consisted of a silicon-tungsten electromagnetic sampling calorimeter 
(ECAL) [0], the AHCAL, and a scintillator-steel tail catcher and muon tracker (TCMT) [HO]. The 
test beam setup was also equipped with various trigger and beam monitoring devices. 

The ECAL [0] has a total depth of 24 radiation lengths (approximately 1 nuclear interaction 
length A/) and consists of 30 active silicon layers arranged in three longitudinal sections with dif- 
ferent absorber thicknesses. In this study, the ECAL was used for event selection and early shower 
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detection. Since the present study focuses on the AHCAL, events with a primary inelastic interac- 
tion in the ECAL are rejected, as discussed below. 

The AHCAL [H] consists of small 5 mm thick plastic scintillator tiles with embedded wave- 
length-shifting fiber and individual readout by silicon photomultipliers (SiPMs). The tiles are as- 
sembled in 38 layers with lateral dimensions of 900x900 mm 2 , separated by 21 mm of steel. The 
absorber material in each layer is made of 17 mm thick absorber plates and two 2 mm thick cover 
plates of the cassettes that house the scintillator cells. The size of the scintillator tiles ranges from 
30 x 30 mm 2 in the central region and 60 x 60 mm 2 in the outer region to 120 x 120 mm 2 along 
the perimeter of each layer. In the last eight layers only 60 x 60 mm 2 and 120 x 120 mm 2 tiles are 
used. In total, the CALICE AHCAL has 7608 scintillator cells and a thickness of 5.3 A/ (4.3 X n ). 
To be able to correct off-line for variations in response of the photon sensors caused by the sub- 
stantial temperature variations in the experimental hall, the temperature of the AHCAL inside the 
readout cassettes is monitored in each layer by five sensors installed equally spaced from bottom 
to top in the centre of the detector. 

The TCMT [ITOl consists of 16 readout layers assembled from 5 mm thick, 50 mm wide and 
1000 mm long scintillator strips with embedded wavelength-shifting fibers read out at one end 
by SiPMs. Each of the scintillator layers has 20 strips for a total of 320 strips in the TCMT. 
The scintillator is sandwiched between steel absorber plates. The TCMT has two sections with 
different sampling fractions, one fine section with 21 mm thick absorbers for the first nine layers 
(where the absorber plate for the first layer is the back plate of the AHCAL), and a coarse section 
with 104 mm thick absorbers. Two mm of the absorber thickness in each layer is provided by 
the cover sheets of the scintillator strip cassettes. In this study the information from the TCMT is 
used for muon separation and to measure energy leaking out the back of the AHCAL, which is of 
particular importance at higher energies. The total depth of the CALICE calorimeter setup amounts 
to approximately 12 A/, with a total of 17 648 readout channels. 

In addition to the calorimeters themselves, the setup includes auxiliary detectors for triggering, 
tracking and particle identification as shown in Figure [I]. The scintillation counters Scl, Sc2 and 
Sc3 provide the beam trigger, where a coincidence between at least two out of the three is required. 
In addition, Sc2 has an analogue readout to tag multi-particle events. The large area veto counter is 
used to reject beam halo events and a large area scintillator counter Mc2 downstream of the TCMT 
provides muon tagging for particles penetrating the full calorimeter setup. For dedicated muon 
runs, an additional large area scintillation counter, Mcl, is installed upstream of the calorimeters. 
Three drift chambers DC 1 , DC2 and DC3 determine the position of the incoming beam particles. 
Particle identification is provided by a threshold Cerenkov counter upstream of the calorimeters, 
which discriminates between electrons and pions or between pions and protons in negatively or 
positively charged beams, respectively, by appropriately chosen gas pressures. 

2.2 Event selection 

The response of the individual calorimeter cells is calibrated with muons, using the visible signal 
of a minimum-ionising particle (MIP) as the cell-to-cell calibration scale. This signal corresponds 
to 13 detected photo-electrons in typical cells. After this cell-to-cell calibration, the most probable 
energy loss of a MIP is used as the base unit of the energy measurement. To reject noise, only cells 
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Table 1. Summary of the data samples. The total number of pions is the number of events classified as 
pions, after rejection of empty, noisy and double particle events, and the application of muon rejection and 
particle identification cuts. The number of selected pions are the events with an identified shower start in 
the first five layers of the AHCAL, which are used in the present analysis. For most energies, several run 
periods at different temperatures are combined to maximise statistics. 

with a visible energy above a threshold of 0.5 MIP are used in the analysis, referred to as hits in the 
following. 

The data samples for the present analysis are selected from %~ and % + data in the energy range 
of 10 to 80GeV and 30 to 80GeV respectively, as summarised in Table To maximise statistics, 
data from several run periods taken at different temperatures are combined for most energies, with 
corrections for the temperature dependence of the response of the photon sensors applied during 
event reconstruction [fTTH . The event selection procedure purifies the pion sample by rejecting 
admixtures of muons, electrons, and protons. To identify muons, information from the ECAL, 
AHCAL and TCMT is used, requiring small energy deposits consistent with a minimum-ionising 
particle in all three detectors. Optimal separation of muons and hadrons is achieved by using beam 
energy-dependent constraints on the energy sum in the TCMT versus the combined energy sum of 
the ECAL and AHCAL. For beam energies of 30 GeV and 35 GeV a muon contamination at the 
level of 30% and 15% before muon rejection is observed, respectively. For all other energies the 
muon content does not exceed 7%. After the event selection, the muon content is below 0.5% at all 
energies, estimated using the muon identification efficiency of 98% at 10 GeV and 99.5% at 30 GeV 
and above, which is determined from muon data and simulations. Protons and kaons are removed 
from the 7i + samples by requiring a positive pion identification in the Cerenkov counter. Even 
before selection, the kaon content is below 3% at all energies. The proton content of the beam 
is very small below 30 GeV since a tertiary beam is used at these energies, and varies between 
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approximately 15% and 30% at higher energies. Since the positive identification of K + is based on 
the detection of Cherenkov photons the proton and kaon contamination of the positive pion sample 
is negligible. Electrons are removed from the %~ sample both by the Cerenkov counter and by 
selecting events with no inelastic interaction in the ECAL, as discussed below. 

Since the goal of the present analysis is the study of the performance of the AHCAL, pion 
showers that develop predominantly in the AHCAL are selected. This is achieved by requiring 
that the position of the primary inelastic interaction is located in the first five layers of the hadron 
calorimeter. This excludes events with sizeable energy deposit in the ECAL while keeping en- 
ergy leakage into the TCMT to a minimum. The location of the primary inelastic interaction is 
determined by detecting the change from a minimum-ionising particle track to multiple secondary 
particles, evidenced by increased energy deposition and number of hits over several consecutive 
layers fl7fl. Simulation studies indicate that the difference between the reconstructed and the true 
primary interaction layer does not exceed one layer for 78% of all events and does not exceed two 
layers for more than 90% of all events in the energy range from 10 to 80 GeV. 

2.3 Energy reconstruction and intrinsic energy resolution 

To measure the energy deposited in the sub-detectors, a conversion from the visible signal in MIP 
units to the total energy in units of GeV is necessary. Since only hadrons with a shower start 
in the AHCAL are considered, the relevant conversion factor for the ECAL is determined using 
simulated muons to obtain the correlation between the visible energy and the true ionisation energy 
loss in the detector. This factor is validated with the measured response to muons obtained from 
a sample of muon data. The ratio of the visible signal in the active silicon to the total deposited 
energy in active and passive material is approximately 25% higher for minimum-ionising particles 
than for electromagnetic showers, resulting in a lower conversion factor than that for electrons 
presented in fOl . The total energy deposited in the AHCAL is obtained at the electromagnetic 
scale, using calibration factors determined for electron and positron data [TOl . Since the AHCAL 
is a non-compensating calorimeter, the response to hadrons differs from that to electrons, requiring 
an additional scaling factor. This factor is determined by comparing the reconstructed energy for 
pions using the electromagnetic calibration factors with the known beam energy. In the present 
study, the energy dependence of this factor is ignored by taking a constant ^ = 1.19, corresponding 
to the average over the energy range studied. Since the first nine TCMT layers are essentially 
identical to the AHCAL layers in terms of absorber and active material, the same electromagnetic 
calibration factors and an identical | ratio are assumed. For the last seven TCMT layers, the 
calibration factors are adjusted according to the increased absorber thickness. 

For each event, the uncorrected reconstructed energy for hadrons, E uac , is given by the sum of 
reconstructed energies in the three calorimeters, 

£unc = ^ECAL ~i (-^HCAL + -EtCMt) , (2-1) 

K 

where £ecal * s tne measured energy in the ECAL deposited by the particle track, and £"hcal and 
£tcmt are the energies measured in the AHCAL and in the TCMT, both given at the electromag- 
netic scale. The energy in each subdetector is given by the sum of all hits above a noise threshold 
of 0.5 MIP. 
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Figure 2. Reconstructed energy distributions for 10 GeV n~ (a) and 80 GeV n~ (b) without compensation 
(black circles) and after local software compensation (LC), shown by the blue triangles, and after global 
software compensation (GC), shown by the red squares. The curves show Gaussian fits to the distributions 
in the range of ±2 standard deviations. Errors are statistical only. 



The resulting reconstructed energy distributions are fit with a Gaussian in the interval of ±2 
standard deviations around the mean value, providing good fits for all energies. The differences 
compared to a fit over the full range are on the sub-percent level for the extracted mean and on 
the one percent level for the standard deviation and depend on the beam energy. Fitting over the 
full range reduces the fit quality for some energies in particular for the uncorrected data, leading to 
the choice of ±2 standard deviations for best consistency between the different data points. In the 
following, the mean and standard deviation of this Gaussian fit at a given beam energy are referred 
to as the mean reconstructed energy iSreco and the resolution a reco » respectively. 

Systematic uncertainties on the energy measurement in the AHCAL are discussed in detail in 
lfi"5H . For the reconstruction of hadrons, the main uncertainty is due to the MIP to GeV conversion 
factor that is extracted from the electromagnetic calibration of the detector. The size of the uncer- 
tainty was studied thoroughly for the present data set, and is determined to be 0.9% by varying the 
calibration constants within the allowed limits. Other effects which contribute to the uncertainties 
for electromagnetic showers, such as the saturation behaviour of the photon sensor, are found to be 
negligible for hadrons even at the highest energies studied here. 

Figure @ shows the distribution of reconstructed energies for 10 GeV and 80 GeV pions, with 
the uncorrected reconstructed energy shown by black data points. At all energies, the distributions 
of the reconstructed energies follow a Gaussian distribution well, with typically more than 95% 
of all events in the fit range of ±2 standard deviations. The software compensation methods also 
included in the figure are described in Sections [O] (local software compensation) and (global 
software compensation). 

Figure shows the mean reconstructed energy versus beam energy, with the black points 
giving the uncorrected reconstructed energy. The measured responses to positive and negative 
pions agree well within the systematic uncertainties, which are shown by the green band. Relative 
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Figure 3. (a) Mean reconstructed energy for pions and (b) relative residuals to beam energy versus beam 
energy without compensation (black circles) and after local software compensation (LC), shown by the blue 
triangles, and after global software compensation (GC), shown by the red squares. Filled and open markers 
indicate n~ and n + , respectively. Dotted lines correspond to E KCO = £beam- Systematic uncertainties are 
indicated by the green band, which corresponds to the uncertainties for the uncorrected n~ data sample. 



residuals to the beam energy are shown in the lower panel of Figure 0, where AE = E reco —E\, eam . 
The linearity of the calorimeter response to hadrons showering predominantly in the AHCAL is 
within ±2% in the studied energy range. 

The fractional energy resolution, CJ r eco/^reco> is shown in Figure |]. Again, the uncorrected 
resolution is indicated by black points. The measured resolution for n~ is in very good agreement 
with that obtained for n + , with the differences smaller than the size of the markers for all energies 
where both ii~ and n + results exist. The black solid curve shows the result of a fit to these points 
with the following function: 

P" = ^=®b(B-^, (2.2) 

^reco V E-beam ^beam 

where £beam is the beam energy in GeV, and a, b and c are the stochastic, constant and noise con- 
tributions, respectively. The noise term is fixed to c = 0.18 GeV, corresponding to the measured 
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Figure 4. Energy resolution versus beam energy without compensation and after local and global software 
compensation. The curves show fits using Equation [2. 4 , with the black solid line showing the fit to the 
uncorrected resolution, the red dotted line to the global software compensation and the blue dashed line to 
the local software compensation. The stochastic term is (57.6 ±0.4)%, (45.8 ±0.3)% and (44.3 ±0.3)%, 
with constant terms of (1.6±0.3)%, (1.6±0.2)% and ( 1 .8 ± 0.3)% for the uncorrected resolution, global 
software compensation and local software compensation, respectively. 

noise contribution in the full CALICE setup taking into account contributions from the ECAL 
(0.004 GeV), the AHCAL (0.06 GeV) and the TCMT (0.17 GeV). These values are obtained 
from the standard deviation of the noise levels measured in dedicated runs without beam parti- 
cles as well as in random trigger events constantly recorded during data taking. From the fit, the 
stochastic term of the uncorrected hadron energy resolution of the AHCAL is determined to be 
(57.6 ± 0.4) %/y/E/GeN and the constant term to be (1.6 ±0.3)%. 

3. Software compensation: motivation and techniques 

In ideal sampling calorimeters the energy measured for electromagnetic showers is directly propor- 
tional to the incoming particle energy. In the absence of instrumental effects such as non-linearities 
or saturation of the readout, the energy of a particle can thus be obtained by multiplying the visible 
signal by a single energy-independent factor accounting for the non-measured energy depositions 
in the passive absorber material. 

The calorimeter response to hadron- induced showers is more complicated OT4T1 . since these 
showers have contributions from two different components: an electromagnetic component, origi- 
nating primarily from the production of 7T°s and 17s and their subsequent decay into photon pairs; 
and a purely hadronic component. The latter includes "invisible" components from the energy 
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loss due to the break-up of absorber nuclei, from low-energy particles absorbed in passive material 
and from undetected neutrons, depending on the active material. This typically leads to a reduced 
response of the calorimeter to energy in the hadronic component, and thus overall to a smaller 
calorimeter response to hadrons compared to electromagnetic particles of the same energy. Since 
the production of 7r°s and 17s are statistical processes, the relative size of the two shower compo- 
nents fluctuates from shower to shower, which, combined with the differences in visible signal for 
electromagnetic and purely hadronic energy deposits, leads to a deterioration of the energy resolu- 
tion. In addition, the average fraction of energy in the electromagnetic component depends on the 
number of subsequent inelastic hadronic interactions and thus on the initial particle energy. The 
electromagnetic fraction of hadronic showers increases with increasing particle energy [TT3T1 . often 
resulting in a non-linear response for non-compensating calorimeters. 

There are two fundamentally different approaches to improve the energy resolution of a ha- 
dronic sampling calorimeter. One approach is to eliminate the issue of different response to elec- 
tromagnetic and hadronic components by design, through the construction of so-called compensat- 
ing calorimeters. This can be achieved by specific choices of absorber and active material which 
enhance the sensitivity to neutrons, and thus to the hadronic component of the shower, and by ap- 
propriately chosen sampling fractions. However, these conditions impose very strict requirements 
on the materials used and on the overall geometry of the whole detector system. One promi- 
nent example of a compensating calorimeter is the uranium- scintillator calorimeter of the ZEUS 
experiment HT6L PL which reached a stochastic resolution term of 34.5%/ ^E/GeV for single 
pions flU. 

On the other hand, for intrinsically non-compensating calorimeters, compensation can be 
achieved by so-called "off-line weighting" or "software compensation" techniques. These tech- 
niques assign different weights to electromagnetic and hadronic energy deposits on an event-by- 
event basis. The differing spatial structure of the electromagnetic and hadronic components of par- 
ticle showers can be used to characterise the origin of energy deposits. Since the radiation length 
is much shorter than the nuclear interaction length in heavy absorbers used in hadronic calorime- 
ters, electromagnetic sub-showers are more compact than purely hadronic sub-showers, generally 
resulting in a higher energy density of the electromagnetic component. The application of software 
compensation techniques relies on longitudinal and lateral segmentation of the calorimeters, to pro- 
vide the necessary information for a measurement of the energy density of particle showers. One of 
the first applications of such techniques was in the WA1/CDHS scintillator steel calorimeter, where 
an improvement of the hadronic resolution between 10% and 30% was achieved in the energy 
range of 10 GeV to 140 GeV [TTPTl - These techniques were further refined and applied in various 
experiments, such as the HI liquid argon calorimeter flZOl and the ATLAS calorimeter system flZTH . 

With its unprecedented high granularity, the CALICE AHCAL is well suited for such tech- 
niques. In the present paper, two techniques based on an event-by-event analysis of the hit en- 
ergy distributions are discussed. The local software compensation (LC) procedure is based on a 
re-weighting of each individual hit depending on the local energy density. The global software 
compensation (GC) procedure uses the distribution of hit energies to derive one global factor for 
the correction of the reconstructed energy of the complete hadronic shower. The parameters used 
for both techniques are determined from test beam data, as discussed in detail below. The available 
data set is split into two samples of equal event count, a training data set and the data set used to 
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study the energy reconstruction. This ensures a statistical independence of the data used to deter- 
mine the parameters for the software compensation algorithms and the data used to evaluate the 
performance of the techniques. 

3.1 Local software compensation 

The local software compensation technique improves the energy reconstruction for hadrons by 
applying weights to the energy recorded in every cell of the AHCAL within a hadronic shower. 
The weights are chosen based on the local energy density, which is taken as a measure of the 
likelihood that a given cell belongs to an electromagnetic or a hadronic sub-shower. In the present 
study, the energy content of a cell, divided by its volume, is taken as the relevant local energy 
density. Electromagnetic sub-showers typically have a higher energy density than purely hadronic 
ones, and, due to the non-compensating nature of the AHCAL, result in a larger detector signal 
per unit of deposited energy. Thus, cells with a higher energy content are assigned a lower weight 
in the total energy sum than cells with a low energy content to correct for this difference. The 
reconstructed energy of each event corrected with local software compensation, Z?lc> is thus given 
by introducing weights for each AHCAL hit in Equation [Ol , resulting in 



where ft), is the energy density dependent weight applied to the cell energy ^hcal,;- 

To make the technique robust against fluctuations entering due to the relatively low number of 
hits in a given event, the single cell energy density distribution is subdivided into bins in energy 
density, as illustrated in Figure |5] (a). The binning is also needed for the minimisation technique 
chosen here for the determination of the weights as discussed below. For each bin, a separate 
weight is determined which is applied to all hits that fall into that particular bin. The number 
of sub-divisions in energy density is chosen as a compromise between the requirements for fine 
subdivisions to maximise the sensitivity of the algorithm to differences in shower structure on one 
hand, and the stability of the determination of the weights and of the algorithm on the other hand. 
While a fine binning improves the sensitivity to the shower structure, a robust determination of 
the weights requires sufficient statistics in each bin, and changes of the weights from bin to bin. 
The performance of the local software compensation does not depend on the precise choice of bin 
number and bin borders. 

Since the overall energy density of hadronic showers changes with energy, the weights CO 
depend both on the cell energy density p and on the particle energy. The weights, as a function 
of energy density and particle energy, are determined from the training data set extending over the 
full energy range studied here. The optimal weights are found by minimising a simplified % 2 given 
by the function x 2 = LiC^lc,; — £beam) 2 > where Ej_c,i is the reconstructed energy of a given event 
using software compensation, and the sum runs over all events used for the weight determination. 
In this minimisation, the bin by bin weights are used as free parameters. Figure |5| (b) shows the 
optimal weights determined with this procedure for four different energies. The weights at a given 
beam energy can be parametrized by 




(3.1) 



oo = po + pi-exp(p 2 -p) 



(3.2) 
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Figure 5. (a) Distribution of the cell energy density in the AHCAL for 40 GeV pion showers. The energy 
density is given by the uncorrected reconstructed energy in that cell, divided by the corresponding absorber 
volume. The different energy density bins used in the analysis are indicated by colour shades, (b) Optimal 
weights as a function of energy density for different beam energies, determined without constraints of a 
specific functional form in the first iteration of the minimisation. 



where p is the energy density corresponding to the centre of the energy density bins introduced 
above, and po,pi and p2 are parameters of the weight function. These parameters depend on the 
beam energy, with their energy dependence following exponential functions in particle energy for 
po and p\, and a logarithmic function in particle energy for p2- A robust determination of the 
weights is achieved by an iterative minimisation procedure, where the free parameters po, p\ and 
P2 are consecutively fixed to the function determined in the previous minimisation stage. 

For the application of this technique to data, no a priori knowledge of the particle energy is 
required, as the uncorrected reconstructed particle energy is used instead of £"beam to select the 
correct weight parametrisation. Since the energy dependence of the weight parameters is not very 
steep, this does not introduce a noticeable bias for the reconstructed energy. A second iteration 
does not lead to significant further improvement and is thus not performed in the reconstruction. 

3.2 Global software compensation 

The global software compensation technique improves the energy resolution for hadrons by apply- 
ing a single weight to the reconstructed shower energy. This weight is derived from the distribution 
of hit energies in the hadronic shower, providing sensitivity to the overall energy density, and thus to 
the fraction of hits in electromagnetic sub-showers. Since electromagnetic sub-showers are charac- 
terised by a high local energy density, a hadronic shower with a large electromagnetic content will 
have a larger fraction of high-energy hits than a shower with predominantly hadronic contributions. 

The determination of the event weight is based on a phenomenological approach using the 
fraction of calorimeter hits below a certain energy threshold. This fraction is computed for each 
event, and provides a measure of the importance in each shower of low-density energy deposits, 
which are expected to be predominantly of hadronic origin. Based on this, with an additional con- 
sideration of the overall hit energy distribution given by the number of hits below the mean energy 
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Figure 6. Correlation of the factor C g i b a i and the reconstructed energy in the AHCAL, Ehcal, f° r showers 
induced by n + at 30 GeV. 

value of the hit energy, the factor C g i b a i is constructed, which is used to correct the reconstructed 
energy. This factor, calculated for each event, is given by the ratio of the number of shower hits 
with a measured visible signal below a given threshold eu m and the number of shower hits with a 
measured visible signal below the mean value of the hit energy spectrum for that particular event. 
Figure ^| illustrates the sensitivity of the factor C g i b a i to the reconstructed energy, for a value of 
en m = 5 MIP applied to n + events at 30 GeV. The clear anti-correlation between the reconstructed 
energy and C g i b a i provides the basis for an improved energy reconstruction using this factor. The 
anti-correlation is due to the fact that events with a high electromagnetic content tend to have a 
larger number of high-energy hits above e\ ira and thus a lower C g i bai> while those events have a 
higher reconstructed energy. 

The value of e\\ m was optimised to provide good performance of the algorithm over the full 
energy range, with the linearity of the detector response taken as a key factor. While higher values 
for eij m provide stricter separation of electromagnetic and non-electromagnetic events, if the value 
is set too high this results in asymmetric distributions of C g i bai at lower energy, leading to reduced 
performance. These asymmetries originate from the reduced number of high-energy hits at low 
particle energies. For example, a large fraction of 10 GeV pion showers have essentially no hits 
above 7 MIP. Too low values, on the other hand, result in a non-linear response due to the reduced 
sensitivity to the electromagnetic component at higher particle energies. Best performance was ob- 
tained for a value of eu m = 5 MIP. For the highly granular core of the calorimeter, this corresponds 
to an energy density of 7.4 GeV/1000 cm 3 in Figure |5] (a). For the energy range studied, the mean 
hit energy is between 2.7 to 4.7 MIP. Figure [7] shows the distributions of C g i b a i for different ener- 
gies, demonstrating its energy dependence, originating from the change of the overall hit energy 
spectrum with changing particle energy. When applying C g i b a i in the energy reconstruction, this 
dependence has to be corrected for, as discussed below. 
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Figure 7. Distributions of the factor C g ] ba] for hadronic showers induced by % with an energy of 10 GeV 
(blue squares), 35 GeV (black circles) and 80 GeV (red triangles), respectively. Statistical errors are shown. 



The reconstructed energy with global software compensation is obtained in two steps. First, a 
corrected shower energy is calculated by multiplying the reconstructed energy in the AHCAL and 
in the TCMT by the factor C g i oba i, giving E S hower = Cgiobai ■ (£hcal + £tcmt)- From this corrected 
shower energy, the final reconstructed energy with global software compensation for a given event, 
£gc> i s tnen obtained from 

£gc — -^ECAL + -^shower " ^global (-^shower) > (3.3) 

where /global (^shower) is a function which accounts for the energy dependence of the compensa- 
tion parameters, visible in Figure [7] as the shift of the mean of C g i b a i with energy. This func- 
tion depends on the corrected shower energy Shower and is given by a second-order polynomial, 
^global (^shower) = «o + a i • Shower + «2 ■ Shower- Tne parameters for this function are obtained from 
a fit of the dependence of the corrected shower energy Shower on the true deposited energy given 
by the beam energy corrected for the energy deposited in the ECAL, and are extracted from a 
training data set extending over the full energy range considered here. They are found to be 
a Q = 0.982 ± 0.007, a\ = 0.0041 ±0.0003 GeV" 1 and a 2 = (-2.2 ± 0.3) • 10~ 5 GeV 2 . 

The application of the global software compensation technique does not require knowledge 
of the beam energy, since the energy reconstructed in the HCAL and TCMT is used also in the 
determination of the correction of the energy dependence of the compensation parameters. 

4. Results 

To evaluate their performance, both software compensation techniques are applied to test beam 
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data and to simulated data. The parameters for the algorithms are determined using test beam data 
following the training procedures outlined above. 

4.1 Application of software compensation to test beam data 

When applying the software compensation techniques to test beam data, the energy dependent 
compensation factors are determined event-by-event using the uncorrected reconstructed energy. 
Figure @ shows the distribution of reconstructed energies for the uncorrected reconstruction com- 
pared with both software compensation techniques studied. The results are shown for pions with 
energies of 10 GeV and 80 GeV. In both cases, the software compensation algorithms improve 
the energy resolution, evidenced by a narrowing of the distributions, while preserving or even im- 
proving the Gaussian form of the distributions. The algorithms also bring the mean value of the 
reconstructed energy closer to the beam energy, resulting in small shifts of the maxima visible in 
Figure @. The mean reconstructed energy with local and global compensation techniques, com- 
pared to the uncorrected response without compensation, is shown in Figure |3] for all energies 
studied. For both techniques, all points fall within ±1.5% of linearity. 

The energy resolution before and after compensation is shown in Figure |]. Good agreement 
between the 7i~ and % + samples is observed. The energy dependence of the energy resolution 
is well described by Equation \T3\ with a fixed noise term c = 0. 1 8 GeV as discussed in Section 

The fit results are summarised in Table 0. The application of software compensation results 
in a decrease of the stochastic term while the constant term remains unchanged. Both compensa- 
tion techniques show very similar performance, with the local software compensation providing a 
slightly smaller stochastic term, and slightly better performance at intermediate energies. 

Table 2. Stochastic, constant and noise term contributions to the resolution of the CALICE AHCAL deter- 
mined with a fit of Equation to data. 





a [%] 


b [%] 


c [GeV] 


uncorrected 


57.6±0.4 


1.6±0.3 


0.18 


local compensation 


44.3±0.3 


1.8±0.2 


0.18 


global compensation 


45.8±0.3 


1.6±0.2 


0.18 



Figure |S| shows the relative improvement of the energy resolution achieved with the software 
compensation techniques, defined as the ratio of the resolution after software compensation Osc 
(local or global) to the uncorrected resolution a unc - The improvement ranges from ~12% to ~25% 
in the energy range studied, for both techniques, with approximately 3% better relative improve- 
ment observed for the local technique in the energy range from 25 GeV to 60 GeV. The reduced 
performance at high energy is partially due to increased leakage into the TCMT. Energy deposits 
in the TCMT are not weighted in the local software compensation since their energy density is not 
well defined. In the global software compensation, the weight is applied also to TCMT energy de- 
posits, but those are not considered in the determination of the weighting factor due to the different 
readout geometry which leads to increased uncertainties in the weight determination. 
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Figure 8. Energy dependence of the relative improvement of the resolution with local and global software 
compensation observed for data. Where available, results for n~ and n + are averaged for clarity. 



4.2 Comparison to Monte Carlo simulations 

The stability of both software compensation techniques, as well as the realism of simulation mod- 
els, is tested using Monte Carlo simulations. For this purpose, the software compensation algo- 
rithms with coefficients derived from data are applied to Monte Carlo samples generated with a de- 
tailed detector model in GEANT4.9.4 Q32] using two physics lists: QGSP_BERT and FTF_BIC QZJ. 
The QGSP_BERT physics list was chosen because it is the most widely used model in high energy 
physics experiments at present. The FTF_BIC physics list, in turn, has provided good results in a 
previous CALICE analysis [Q] and is completely independent from QGSP_BERT. 

Details on the simulation procedure for the AHCAL can be found in HT3H . For the chosen 
physics lists, samples of 7T + and %~ events were simulated at the same energies as the data points 
using beam profiles, detector temperatures and voltage settings from the data runs. The calibration 
of the simulation was performed at the MIP level by converting the simulated energy deposits in the 
scintillator into MIPs using the most probable energy loss of muons determined in simulations. The 
simulated data sets were passed through the same event selection and reconstruction procedures as 
real data, using the conversion factors from the MIP scale to reconstructed energy determined 
for data as discussed in Section [Oj This also includes the requirement for an identified primary 
inelastic interaction in the first five layers of the AHCAL. 

The uncorrected reconstructed energy as a function of beam energy is shown for data and both 
physics lists in Figure (a). The relative deviation from the beam energy, shown in Figure |9| (b), 
indicates that simulations with both physics lists behave differently than the data. Both models 
overestimate the reconstructed energy at high particle energies. In addition, QGSP_BERT exhibits 



-15- 



ffl 90 
O 

8 80 

a> 
Li] 



70 ; 
60 E- 

50^ 
40 : 
30 
20 
10 



« 0.06 

ty o.o4 

< 0-02 


-0.02 
-0.04 



T 



Data: 7i" 
Data:7r + 

QGSP_BERT:tT 
QGSP_BERT:ti + 
FTF_BIC:7f 
FTF BIC:ti + 



(a) 



CALICE 




10 20 30 40 50 60 70 80 90 



"beam 



[GeV] 



Figure 9. (a) Uncorrected response to pions and (b) relative residuals to beam energy versus beam energy 
for data (black circles), QGSP_BERT (red squares) and FTF_BIC (blue triangles). Filled and open markers 
indicate n~ and n + , respectively. Dotted lines correspond to Zsreco = £beam> while the green band shows 
systematic uncertainties for the uncorrected %~ data sample. 



fluctuations in the transition region between different models in the region between 10 GeV and 
20 GeV. In general, the reconstructed energy for simulations is less linear than for data. 

Figure |T^ shows the energy resolution without software compensation, comparing data and 
simulations. Again, the behaviour of the simulations is different from that of the data, with both 
models underestimating the resolution at low energy, and with FTF_BIC overestimating the resolu- 
tion above 30 GeV. This difference leads to a reduced stochastic resolution term with a significantly 
increased constant term. The results of the fits to the data points using Equation ^2] are summarised 
in Table Q 

The shape of the distribution of the hit energy density is quite well reproduced by both physics 
lists, providing the basis for an application of the software compensation algorithms to simulations 
using the parameters determined from data. Figure |TT] shows the distributions for two representative 
energies. The distributions are normalized to the number of entries to show the overall shape while 
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Figure 10. Uncorrected energy resolution versus beam energy for data as well as simulations using the 
physics lists QGSP_BERT and FTF_BIC. The curves show fits using Equation [2. 4 The stochastic terms are 
(57.6 ±0.4)%, (51.8 ±0.3)% and (49.4 ±0.3)%, with constant terms of (1.6±0.3)%, (4.0±0.1)% and 
(6.1 ±0.1)% for data, QGSP_BERT and FTF_BIC, respectively. 




Figure 11. Energy density distribution for data and the two physics lists QGSP_BERT and FTF_BIC for (a) 
10 GeV and (b) 60 GeV. The energy density is given by the uncorrected reconstructed energy in that cell, 
divided by the corresponding absorber volume. The distributions are normalized to the number of entries 
(number of hits). 
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Figure 12. Detector response to pions with software compensation comparing data and simulations. For 
both data and simulations compensation parameters derived from data are used, (a) Response with local 
software compensation and (b) corresponding relative residuals to beam energy, (c) Response with global 
software compensation and (d) corresponding relative residuals to beam energy. 



ignoring differences in the normalisation originating from different energy sums. The distributions 
show a slight overestimation of the fraction of high-density hits by the simulations. In addition, 
uncertainties in the treatment of saturation effects of the photon sensor lead to an excess of cells 
with very high energy content well beyond the range shown in the figure. 

The effect of the application of the software compensation algorithms, with parameters ex- 
tracted from data, on the reconstructed energy in simulations is shown in Figure 0. For both 
compensation techniques, the underestimation of the detector response at low energy, in particular 
by the QGSP_BERT physics list, remains present. At intermediate energies from 20 GeV up to 
50 GeV, the application of software compensation results in an improved response linearity and 
in a better agreement between data and simulations for both physics lists considered. At higher 
energy, a significant overestimation of the reconstructed energy by simulations is seen with local 
software compensation, while the global software compensation technique successfully corrects 
the non-linearity of the simulations in that energy regime. This difference in behaviour is partially 
due to uncertainties in the treatment of saturation effects in simulations, and potentially also re- 
ceives a contribution from imperfect descriptions of the shower structure by the shower models 
themselves. In the simulations, the number of cells with very high energy content is overestimated 
and exhibits a longer tail than in data, as discussed above. This affects the correction factor of the 
global software compensation by construction, resulting in a lower shower weight for simulations 
compared to data at the same energy on average, bringing data and simulations into better agree- 
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Figure 13. Energy resolution for pions with local (a) and global (b) software compensation comparing data 
and simulations. For both data and simulations compensation parameters derived from data are used. The 
curves show fits using Equation |2.2| . The fit results for the local software compensation are (44.3 ± 0.3)%, 
(42.3 ±0.2)% and (40.4 ±0.3)% for the stochastic term, with constant terms of (1.8 ±0.2)%, (2.5 ±0.1)% 
and (3.4 ±0.1)% for data, QGSP_BERT and FTF_BIC, respectively. For the global software compensation, 
the results are (45.8 ±0.3)%, (43.6 ±0.2)% and (43.4 ±0.3)% for the stochastic term, with constant terms 
of (1 .6 ± 0.2)%, (0.0 ± 0.2)% and (1 . 1 ± 0.2)% for data, QGSP_BERT and FTF_BIC, respectively. 



Table 3. Fit results using the function given in Equation ( |2.2| ) for simulations with and without software 
compensation, compared to the corresponding values for data. 





a [%] 


b [%] 


c [GeV] 


uncorrected data 


57.6±0.4 


1.6±0.3 


0.18 


uncorrected QGSP_BERT 


51.8±0.3 


4.0±0.1 


0.18 


uncorrected FTF_BIC 


49.4±0.3 


6.1 ±0.1 


0.18 


local compensation data 


44.3±0.3 


1.8±0.2 


0.18 


local compensation QGSP_BERT 


42.3±0.2 


2.5±0.1 


0.18 


local compensation FTF_BIC 


40.4±0.3 


3.4±0.1 


0.18 


global compensation data 


45.8±0.3 


1.6±0.2 


0.18 


global compensation QGSP_BERT 


43.6±0.2 


0.0±0.2 


0.18 


global compensation FTF_BIC 


43.4±0.3 


1.1±0.2 


0.18 



ment. The local software compensation technique applies constant weights for very high-energy 
hits, as can be seen in Figure |5] (b). It is thus less sensitive to these differences between data and 
simulations and preserves the discrepancy in visible energy for high beam energies. 

Figure [T3| shows the energy resolution for simulations compared to that for data for both soft- 
ware compensation techniques. The local software compensation largely preserves the differences 
between data and simulations for the physics list QGSP_BERT, but results in a better agreement of 
FTF_BIC with data, in agreement with the behaviour observed for the reconstructed energy. The 
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Figure 14. Energy dependence of the relative improvement of the resolution for data and simulations using 
the physics lists QGSP_BERT and FTF_BIC, (a) with local software compensation and (b) with global 
software compensation. Where available, results for n~ and n + are averaged for clarity. 



global software compensation brings the overall trend of the resolution with energy for data and 
simulations into good agreement, with better resolution seen for simulations with both physics lists 
than for data. The results of the fits to the data points using Equation are summarised in Table 0, 
together with the results obtained without software compensation. 

The relative improvement in resolution compared to the uncorrected energy resolution is 
shown in Figure [H] for data and simulations. For the local software compensation, the improve- 
ment with respect to energy observed in data is well reproduced by the QGSP_BERT physics list. 
For FTF_BIC, a considerably bigger improvement is seen for the simulations at high energy than is 
seen in data. This higher improvement at high energies results in the better agreement of the energy 
resolution in data and in simulations discussed above. For the global compensation approach, the 
behaviour up to 30 GeV is well modelled by QGSP_BERT, while an up to 20% higher improve- 
ment, compared to that for data, is seen in simulations at the highest energies considered. The 
reason for this different behaviour of local and global software compensation is the same as for the 
different high-energy behaviour for the reconstructed energy as discussed above. 

5. Conclusion 

The hadronic energy resolution of the CALICE analogue hadron calorimeter is studied using test 
beam data collected in 2007 at the CERN SPS. The calorimeter, with an instrumented volume of 
approximately 1 m 3 and a depth of 5.3 A/, is highly segmented in both longitudinal and lateral 
direction, with a total of 7608 electronic channels. The intrinsic energy resolution of the CALICE 
AHCAL for hadrons is measured to be 58%/ y/E/GeV, with a constant term of 1.6%. 



-20- 



The unprecedented granularity of the CALICE AHCAL provides excellent possibilities for the 
application of software compensation algorithms to improve the energy resolution of the calorime- 
ter based on event-by-event information on the energy density structure of the showers. Two tech- 
niques have been presented here, together with results from test beam and from simulated data 
samples. The local software compensation technique uses local energy density information for 
a cell-by-cell re-weighting of energy deposits, while the global software compensation technique 
uses the distribution of cell energies to derive one overall weighting factor for each shower. Both 
techniques show similar performance, with a relative improvement of the energy resolution ranging 
from 12% to 25% over the studied energy range from 10 GeV to 80 GeV, resulting in a reduction 
of the stochastic term to 45%/ /GeV. In GEANT4 simulations with the QGSP_BERT and the 
FTF_BIC physics lists, the detector response is considerably more non-linear than in data. The 
physics list QGSP_BERT provides a satisfactory description of the energy resolution. The appli- 
cation of software compensation using parameters determined from data brings the resolution into 
better agreement with data. Here, the improvement of the energy resolution using the local soft- 
ware compensation technique observed for the QGSP_BERT physics lists is comparable to that 
observed for data, while larger differences are observed for FTF_BIC and for the global software 
compensation technique. 

Neither of the described techniques requires an a priori knowledge of the particle energy. 
The energy dependent compensation factors are selected based on the uncorrected reconstructed 
energy. Although this energy dependence places some restrictions on the implementation of both 
techniques in a collider environment with a high particle density in hadronic jets, their application 
in the context of particle flow algorithms should be possible based on identified calorimeter clusters. 
The jet energy resolution can profit from the improved hadronic energy resolution directly through 
a better measurement of the neutral hadronic component, but also indirectly from an improved 
matching of reconstructed tracks and calorimeter energy during the clustering phase. 
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